S/N 10/005.314 



PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Applicant: Baruch, et al. 
Serial No.: 10/005,314 
Filed: December 7, 2001 

Title: VOICE CONTROL SYSTEM WITH MULTIPLE VOICE 

RECOGNITION ENGINES 

Group Art Unit: 2655 
Examiner: B. Albertall 

REQUEST TO CORRECT INVENTORSHIP UNDER 37 CFR § 1.48(a) 



Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 

Sir: 

It has been found that the above-referenced patent application, through error 
and without deceptive intent, currently improperly sets forth the inventorship of the 
invention claimed therein. Accordingly, Applicant hereby respectfully requests that the 
above-referenced application be corrected in compliance with 37 CFR § 1.48(a), and 
that the following person be added as inventor: 

Eran Aharonson, a citizen of Israel, residing at 27 Hasaifan St., Ramat 
Hasharon, Israel. 

Please find included herewith: 

• A statement from each person being added as an inventor that the error in 
inventorship occurred without deceptive intention on his or her part; 

• An oath or declaration by the actual inventor or inventors; 

• A Revocation/Power of Attorney by the assignee; 

• The written consent of the assignee; 

• A statement under 37 CFR § 3.73(b). 



The processing fee set forth in § 1.1 7(i) is believed to be due in connection 
with the filing of this request. The United States Patent and Trademark Office is 
therefore hereby authorized to charge Deposit Account No. 501380 in the amount of 
One Hundred and Thirty Dollars ($130.00) and any other fees that may be required for 
entry of this request. 



Respectfully submitted, 

Daniel J. Swirsky 
Representative for Applicant 
Registration No. 45,148 



DANIEL J. SWIRSKY 
ALPHAPATENT ASSOCIATES LTD. 
55 REUVEN 
P.O.B. 2345 

BEIT SHEMESH, 99544 
ISRAEL 

TEL. (US) 516-620-4572 
FAX. (US) 206-374-6672 



S/N 10/005,314 



PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Baruch, et aL 
10/005,314 
December 7, 2001 

VOICE CONTROL SYSTEM WITH MULTIPLE VOICE 
RECOGNITION ENGINES 

Group Art Unit: 2655 
Examiner: B. Albertall 

STATEMENT BY INVENTOR UNDER 37 CFR § 1.48(a) 

Commissioner for Patents 
P.O. Box 1450 

Alexandria, VA 22313-1450 , : 

Sir: 

It has been found that the above-referenced patent application currently 
improperly sets forth the inventorship of the invention claimed therein. I hereby respectfully 
request that the above-referenced application be corrected in compliance with 37 CFR § 
1 .48(a), and that I be added as inventor as follows: 

Eran Aharonson, a citizen of Israel, residing at 27 Hasaifan St., Ramat Hasharon, 

Israel. 

I hereby state that the error in inventorship occurred without deceptive intent on 
my part. I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true; and further, that 
these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. / 

Respectfully submitted, 



In re Applicant 
Serial No.: 
Filed: 
Title: 



Attorney Docket No.: 1326-US 
DECLARATION AND POWER OF ATTORNEY FOR PATENT APPLICATION 



As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below under my 

name. 

I believe I am the original, first and sole inventor (if only one name is listed below), 
or an original, first and joint inventor (if plural names are listed below) of the subject matter 
which is claimed and for which a patent is sought on the invention entitled: 

VOICE CONTROL SYSTEM WITH MULTIPLE VOICE 
RECOGNITION ENGINES 

the Specification of which 

Q is attached hereto 

£3 was filed on December 7, 2001 

as Application Serial No. 10/005,314 

and was amended on (if applicable). 

I hereby state that I have reviewed and understand the contents of the above- 
identified Specification, including the claims, as amended by any amendment referred to 
above. 

I acknowledge the duty to disclose information which is material to the examination 
of this application in accordance with Title 37, Code of Federal Regulations, 1.56(a). 

PRIORITY CLAIM - NON-U.S. OR PCT PATENT APPLICATION(S) 

I hereby claim priority benefits under 35 U.S.C. 1 19(a)-(d) or 35 U.S.C. 365(b) of 
any foreign application(s) for patent or inventor's certificate listed below, or 35 U.S.C. 
365(a) of any PCT international application listed below which designated at least one 
country other than the United States of America, and have also listed below to my actual 
knowledge any foreign application for patent or inventor's certificate, or any PCT 
international application having a filing date before that of the application on which priority 
is claimed: 

APPLICATION COUNTRY/ DAY/MONTH/YEAR FILED PRIORITY 
NUMBER CONVENTION CLAIMED 
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Attorney Docket No.: 1326-US 
PRIORITY CLAIM - U.S. PROVISIONAL PATENT APPLICATION(S) 



I hereby claim priority benefits under 35, U.S.C. §1 19 of any U.S. Provisional 
Patent Application listed below that has been filed in the United States in accordance with 
35 U.S.C. §1 19(e), or any U.S. Patent Application listed below that has been converted to a 
U.S. Provisional Application within one (1) year of its filing date: 

APPLICATION DAY/MONTH/YEAR FILED PRIORITY 

NUMBER CLAIMED 

60/254,644 "~ ~ 7 December 2000 YES 



PRIORITY CLAIM - U.S. PATENT APPLICATIONS 

I hereby claim the benefit under 35 U.S.C. §120 of any U.S. Patent Application 
listed below, and, insofar as the subject matter of each of the claims of this application is 
not disclosed in any prior U.S. Patent Application in the manner provided by the first 
paragraph of 35 U.S.C. §1 12, 1 acknowledge the duty to disclose material information as 
defined in 37 CFR § 1 .56(a), which occurred between the filing date of the prior application 
and the national or PCT international filing date of this application: 

APPLICATION DAY/MONTH/YEAR FILED STATUS - PATENTED, 
NUMBER PENDING, ABANDONED 



DESIGNATION OF ATTORNEY OR AGENT 

I hereby appoint Daniel J. Swirsky (Agent, Registration No. 45,148) and Heidi M. 
Brun (Agent, Registration No. 34,504) and practitioners at Customer Number 24505 as 
my/our attorney(s) or agent(s) with full power of substitution and revocation to prosecute 
the above-identified application and transact all business connected therewith in the United 
States Patent and Trademark Office. 

Please address all correspondence regarding this application to Customer No. 
24505, being: 

DANIEL J. SWIRSKY 
ALPHAPATENT ASSOCIATES LTD. 
P.O.B. 2345 
BEIT SHEMESH, ISRAEL 99544 

Please direct all telephone calls to (US) (516) 620-4572, all facsimiles to 
(800) 243-2384, and all e-mail messages to dswirskv@alphapatent.com . 
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Attorney Docket No.: 1326-US 



I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true; and further, that 
these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful false statements mayjeopardize the validity of the 
application or any patent issued thereon. 

NAME OF INVENTOR: BARUCH, Amit 

ADDRESS: 10 Eliahu Hakim Street, Tel Aviv 69120, Israel 

COUNTRY OF CITIZENSHIP: Israel 




DATE 



A , OS < $U01 



NAME OF INVENTOR: 



MOCHARY, Ran 



ADDRESS: 



16 Shikmim Street, Nes Ziona, Israel 



COUNTRY OF CITIZENSHIP: 



Israel 



SIGNATURE OF INVENTOR 



DATE 



NAME OF INVENTOR: 



RIEMER, ltay 



ADDRESS: 



14 Kakal Street, Givataim 53237, Israel 



COUNTRY OF CITIZENSHIP: 



Israel 



SIGNATURE OF INVENTOR 



DATE 



Attorney Docket No.: 1326-US 



I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true; and further, that 
these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 

NAME OF INVENTOR: BARUCH, Amit 

ADDRESS: 10 Eliahu Hakim Street, Tel Aviv 69120, Israel 

COUNTRY OF CITIZENSHIP: Israel 

SIGNATURE OF INVENTOR 

DATE 



NAME OF INVENTOR: MOCHARY, Ran 

ADDRESS : 1 6 Shikmim Street, Nes Zion£ Israel^ 

COUNTRY OF CITIZENSHIP: Israel 



SIGNATURE OF INVENTOR 




DATE 



NAME OF INVENTOR: RIEMER, Itay 

ADDRESS: 26 Kdoshei Shoah St.. Tel-Aviv 69379. ISRAEL 

COUNTRY OF CITIZENSHIP: Israel 



SIGNATURE OF INVENTOR 

DATE & rWc^, L# 0 ^ 
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NAME OF INVENTOR: 
ADDRESS: 

COUNTRY OF CITIZENSHIP: 
SIGNATURE OF INVENTOR . 
DATE /6/3/<?5 



Attorney Docket No.: 1326-US 

BEN-DOR, Nir 

37/3 Kiriat Shmona Street, Holon-58'1-S3Israel 
Israel 



(him 



NAME OF INVENTOR: 
ADDRESS: 

COUNTRY OF CITIZENSHIP: 



DATE 



E OF il 



SIGNATURE OFflNVENTOR . 



YADID, Tal 

Hei Beiyar Street, Rosh Haain 48056, Israel 
Israel 



NAME OF INVENTOR: 
ADDRESS: 

COUNTRY OF CITIZENSHIP: 
SIGNATURE OF INVENTOR . 
DATE 



AHARONSON, Eran 

27 Hasaifan Street, Ramat Hasharon 47248, Israel 
Israel 



4 



Attorney Docket No. : 1326-US 

NAME OF INVENTOR: BEN-DOR, Nir 

ADDRESS: 37/3 Kiriat Shmona Street, Holon 58550, Israel 

COUNTRY OF CITIZENSHIP: Israel 

SIGNATURE OF INVENTOR . 

DATE 



NAME OF INVENTOR: 
ADDRESS: 

COUNTRY OF CITIZENSHIP: 
SIGNATURE OF INVENTOR . 
DATE 



YADID, Tal 

Hei Beiyar Street, Rosh Haain 48056, Israel 
Israel 



NAME OF INVENTOR: AHARONSON, Eran 

ADDRESS: 27 Hasaifan Street, Ramat Hasharon 47248, Israel 

COUNTRY OF CITIZENSHIP: Israel 



SIGNATURE OF INVENTOR 
DATE 
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S/N 10/005314 



PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Applicant: Amit B ARUCH, et al 
Serial No.: 10/005,314 
Attorney Docket No.: 1326-US 
Filed: December 7, 2001 



Title: 



VOICE CONTROL SYSTEM WITH MULTIPLE VOICE 
RECOGNITION ENGINES 



Group Art Unit: 
Examiner: 



2655 
B. Albertall 



REVOCATION/ APPOINTMENT OF POWER QF ATTORNEY 
OR AUTHORIZATION OF AGENT 

Assistant Commissioner for Patents 
Washington, D.C 20231 



As a below-named authorized signatory empowered to sign this statement on behalf of 
Assignee of Record: 

ART ~ ADVANCED RECOGNITION TECHNOLOGIES LTD. 

I hereby revoke all previous powers of attorney or authorizations of agent given in the above- 
identified application and appoint practitioners at Customer Number 24505 and Daniel J. 
Swirsky (Agent, Registration No. 45,148) and Heidi M. Brun (Agent, Registration No. 34,504) as 
my/our attorney(s) or agent(s) with full power of substitution and revocation to prosecute the 
above-identified application and transact all business connected therewith in the United States 
Patent and Trademark Office. 

A statement under 37 CFR 3.73(b) is enclosed. 

Please address all correspondence regarding this application to: 



Sir: 



CUSTOMER NUMBER 24505 



being: 



DANIEL J. SWIRSKY 
ALPHAPATENT ASSOCIATES LTD- 
P.O.B.2345 
BEIT SHEMESH, ISRAEL 99544 
Please direct all telephone calls to (516) 620-4572, all facsimiles to (206) 374-6672, 

and all e-mail correspondence to dswirskv^al nhapatentconk 



AUTHORIZED SIGNATORY: James Arnold 



TTTLE; Chid" Financial Officer/Director 

SIGNATURE: 
DATE: 



tyl4 if, 
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S/N 10/005314 



PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Baruch, et al. 

10/005,314 
December 7, 2001 

VOICE CONTROL SYSTEM WITH MULTIPLE VOICE 
RECOGNITION ENGINES 

Group Art Unit: 2655 
Examiner: B, Albertall 

WRITTEN CONSENT OF ASSIGNEE UNDER 37 CFR § 148(a) 

Commissioner for Patents 

P.O. Box 1450 

Alexandria* VA 22313-1450 

Sir: 

As a below-named authorized signatory empowered to sign this statement on behalf 
of Assignee of Record: 

ART - ADVANCED RECOGNITION TECHNOLOGIES LTD. 

I hereby consent to the addition of the following person as inventor: 

Eran Aharonson, a citizen of Israel, residing at 27 Hasaifan St., Ramat Hasharon, 

Israel. 

A statement under 37 CFR 3.73(b) is enclosed. 

AUTHORIZED SIGNATORY: James Arnold 

Chief Financial Officer/Director 



In re Applicant: 

Serial No.: 
Filed: 

Title: 



TITLE: 

SIGNATURE: 

DATE: 



PTCMSBM(M40j 

Patent ami %SKu"s! &PART»eHT OF COMMERCE 
Umtor tho Pfloorwa* IhductlM A* Of ttrt, no pg^ ^ra mouimd to n*0ond if - "« " dloalMtwrtd QMfl control num*r. 



STATEMENT UNDER 37 CFR 3.73(b) 

Applicant/Patent Amit BARUCH»et a). 



AppHcation No^tent Wo.: 10/005,314 Filed/issue pate: DECEMBER 7,200 1 

FMi»tftrt' VOICE CONTROL SYSTEM WITH MULTIPLE VOICE RECOGNITION ENGINES. 
ART ADVANCED RECOGNITION 

TECHNOLOGIES LTD. a CORPORATION 



(Nam of A«*Hmc*) (Typo of Awlflnoo, a*, cor ponton, ptfaorthlp, imlvonlty, eworflinoni agoMy, ©to.) 

States mat it is: 

1 . |SI tha sasignee of the entire right, title, and Interest; or 

2. Q an assignee of an undivided part interest 
The extant (by percentage) of it* ownership i nterest is % 



in the patent application/patont identified abovo by virtue of either: 

A^An alignment from the inventor(s) of the patent applioatior^patent identified I above. The alignment ^^J^ 
^United States patent and Trademark Office at Reel 01 271 5 , Frame 0495 . or for whieft a copy thereof i* attached. 



OR 



BQ A chain Of litis from the inventor**), of the patent application/patent identified above, to the current assignee a* shown bslow 



LFrom: ~- To: — 

The document was recorded in the Patent and TrademarK Office at 

Rod _ , Frame or for which a copy thereof is attached. 



2. From- To: 

The document was recorded in the Patent and TrademarK Office at 

Roel . Frame or for which a copy thereof is attached. 



3- Prom: To; . 



The document was recorded in the Patent and Trademark Office at 
Real . Frame - or for which a copy thereof is attached- 

PI Additional documents in the chain of title are listed on a supplemental sheet. 

□ Copies of assignments or other documents in the chain of title are attached. 
[NOTE: A separate copy (I.e., the original assignment document or a true copy of the onginal document) 
must be submitted to Assignment Division in accordance with 37 CFR Part 3, if the assignment is to be 
recorded in tho records of the PTO. See MPEP 302.8] 

The undersigned (whose Me is supplied below) la empowered to slgnthis statement on behalf of the assignee. 



Date 



_ *ignatuje ^ 
Typed or printed name - * 

Title £>&«%'f2^~~ 



BiSon Hour Statement: TNiim 5 V^^^^^^^T^^^^^^^^^^^^^ 



S/N 10/005.314 



PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Applicant: Baruch, et al. 
Serial No.: 10/005,314 
Filed: December 7, 2001 

Title: VOICE CONTROL SYSTEM WITH MULTIPLE VOICE 

RECOGNITION ENGINES 

Group Art Unit: 2655 
Examiner: B. Albertall 

INFORMATION DISCLOSURE STATEMENT 



Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 

Sir: 

Entry and consideration of the enclosed Form 1449A - Information 
Disclosure Statement By Applicant is respectfully requested in accordance with 37 
C.F.R. § 1.97(c)(2), being filed by the Applicant before the mailing date of a final Office 
Action. 

A copy of each document or other information listed on the enclosed 
Information Disclosure Statement is provided in accordance with 37 C.F.R. § 1.98(a)(2). 

This submission does not represent that a search has been made or that 
information cited is, or is considered to be, material to patentability as defined in 37 
C.F.R. § 1.56(b), or that information cited is, or is considered to be "prior art" within the 
meaning of 35 U.S.C. §§ 102 and 103, or that information cited has been thoroughly 
reviewed or that any relevance of any portion of a reference is intended. Applicant 



1 



reserves the right to establish that any information cited is not "prior art," and that the 
date of publication indicated for a cited item is in fact different than that which is 
indicated. 

Pursuant to the provisions of M.P.EJP. 609, it is requested that the Examiner 
return a copy of the enclosed Information Disclosure Statement, marked as being 
considered and initialed by the Examiner, to the undersigned with the next official 
communication. 

The United States Patent and Trademark Office is hereby authorized to 
charge Deposit Account No. 501380 for the fee set forth in § 1.1 7(p) and any other fees 
that may be required for entry of this paper. 

Respectfully submitted, 

Daniel J. Swirsky 
Representative for Applicant 
Registration No. 45,148 



ALPHAPATENT ASSOCIATES LTD. 
P.O.B. 2345 

BEIT SHEMESH, ISRAEL 99544 
TEL. (US) 516-620-4573 
FAX. (US) 800-243-2384 
dswirsky@,alphapatent. com 
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PTO/SB/08A (10-01) 
Approved for use through 10/31/2002. OMB 0651-0031 
U.S. Patent and Trademark Office: U.S. DEPARTMENT OF COMMERCE 
Under the Paperwork Reduction Act of 1995. no persons are required to respond to a collection of information unless it contains a valid OMB control number. 
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Substitute for form 1449A/PTO 

INFORMATION DISCLOSURE 
STATEMENT BY APPLICANT 

(use as many sheets as necessary) 



Sheet 



1 



of 



Complete if Known 



Application Number 



Filing Date 



First Named Inventor 



Group Art Unit 



Examiner Name 



Attorney Docket Number 



10/005,314 



December 7, 2001 



Amit BARUCH,et aL 



2655 



B. Albertall 



1326-US 



r 




U.S. PATENT DOCUMENTS 






U.S. Patent Document 


Publication Date 

MM-L/U-YYY Y 


Name of Patentee 
or Applicant of Cited Document 


Pages, Columns, Lines, 

Where Relevant 
Passages or Relevant 
Fiqures Appear 


-vomi nor 

la3 miner 
Initials* 


Cite 
No. 


Number -Kind Code (if known) 






US-6 501 966 


12-31-2002 


Bareis et al. 








US- 












US- 












US- 












us- 
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FOREIGN PATENT DOCUMENTS 


Examiner 
nitials* 


Cite 
No. 


Foreign Patent Document 


Publication Date 
MM-DD-YYYY 


Name of Patentee or 
Applicant of Cited Document 


Pages, Columns, Lines, 

Where Relevant 
Passages or Relevant 
Figures Appear 


T 


Country Code Number ™n n wn° 















































































































































Examiner 




Date 




Signature 




Considered 
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1 Applicant's unique citation designation number (optional). 2 See Kinds Codes of USPTO Patent Documents at www.usDto.aov or MPEP 
901.04. 3 Enter Office that issued the document, by the two-letter code (WIPO Standard ST.3). 4 For Japanese patent documents, the 
indication of the year of the reign of the Emperor must precede the serial number of the patent document. 5 Kind of document by the 
appropriate symbols as indicated on the document under WIPO Standard ST. 16 if possible. 6 Applicant is to place a check mark here if 
English language Translation is attached. 

Burden Hour Statement: This form is estimated to take 2.0 hours to complete. Time will vary depending upon the needs of the individual case. 
Any comments on the amount of time you are required to complete this form should be sent to the Chief Information Officer, U.S. Patent and 
Trademark Office, Washington, DC 20231. DO NOT SEND FEES OR COMPLETED FORMS TO THIS ADDRESS. SEND TO: Assistant 
Commissioner for Patents, Washington, DC 20231. 

*EXAMINER: Initial if reference considered, whether or not citation is in conformance with MPEP 609. Draw line through citation if not in 
conformance and not considered. Include copy of this form with next communication to applicant. 



PTO/SB/08B (08-00) 
Approved for use through 10/31/2002. OMB 0651-0031 
U. S. Patent and Trademark Office: U.S. DEPARTMENT OF COMMERCE 
Paperwork Reduction Act of 1995, no persons are required to respond to a collection of information unless it contains a valid OMB control number. 
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Substitute for form 1449B/PTO 

INFORMATION DISCLOSURE 
STATEMENT BY APPLICANT 

(use as many sheets as necessary) 



Complete if Known 



Application Number 



Filing Date 



First Named Inventor 



Group Art Unit 



Examiner Name 



c Sheet |2 



10/005,314 



December 7, 2001 



Amit BARUCH,et al. 



2655 



B. Albertall 



of 



Attorney Docket Number 



1326-US 



| OTHER PRIOR ART - NON PATENT LITERATURE DOCUMENTS 


Examiner 
Initials* 


Cite 
No. 


Include name of the author (in CAPITAL LETTERS), title of the article (when appropriate), title of the 
item (book, magazine, journal, serial, symposium, catalog, etc.), date, page(s), volume-issue number(s), 
publisher, city and/or country where published. 


T 






Savaraj I. Pawate, Peter Ehlig, "Dialing a Phone by Voice", Machine Design, Jan. 10, 1991 








Thomas B. Schalk, "Voice Recognition in Cellular Mobile Phones", Speech Technology, Sep./Oct. 
1986 















































































Examiner 
Signature 




Date 

Considered 





•EXAMINER: Initial if reference considered, whether or not citation is in conformance with MPEP 609. Draw line through citation if not in conformance and not 
considered. Include copy of this form with next communication to applicant. 



1 Unique citation designation number 2 Applicant is to place a check mark here if English language Translation is attached. 



Burden Hour Statement: This form is estimated to take 2.0 hours to complete. Time will vary depending upon the needs of the individual case. Any comments on 
the amount of time you are required to complete this form should be sent to the Chief Information Officer, U. S. Patent and Trademark Office, Washington, DC 20231. 




KSUTOMOTIVE 



DIALING 
A PHONE 
BY VOICE 



fAYARAJJL PAWATE 
. PETER EKLIG 

*nical Staff Members 
>m Instrument Inc. 
Houston, TX 



Jwomemtsy&xi 
>n table! thai 




jyovrexpeactm 
suggested 

tablet fo' yew ■ 
*l jobless, c 



iptucs 



i easy. 



to tomniencemd safety 
ti&ts m ated for speech 
Ktfttan systems ie diet est 
phones. A voice ttiafer 
tfoetenwnstoiiiQi) mills 
Medina brfeicase^ilze 
hsx. After foiliat program 
'^(tattoo, the dialer plugs 
i cigar lighter retcp&cJs. 
Afreet operation Is verttieti 
^another telephone to a 
wcrting vehicle. 



Soon you may be able to *dial" a car phone and turn 
on the lights and wipers with voice commands. 



Look for speech recognition to be the next 
hot technology in the burgeoning auto* 
motive electronics 'industry. In tact, some 
experts expect voice command systems 
that co ntro i vehicle nmctions to be come 
widely accepted in this decade. 

One application getting a lot cf atten - 
tion today is a speech recognition voice 
dialer for cellular car phonaa. Voice-acti* 
vatad telephone dialing a^bws the driver 
ito^keep his eyes en -the road "and at least 




on* hand on the wheel. Conventional di- 
alers, in contrast, require operators to 
look at a' keypad to punch In numbers/ a 
dasgorous activity in moving vehicles; 

The voice dialer recognizes both male 
and female voices, as well as a number of 
dialects. It can have a vocabulary of 25 or 
more words, depending oh memory size. 
Surprisingly, all this runctiooaJity re- 
quires only one digital signal processor 

(D2F), 

The voice dialer employs n speech rec- 
ognition algorithm known as continuous 
deaslty Hidden Markov Modeling 
(HMM). HMfcia arc statistical models for 
vocabulary words. The algorithms de- 
vised to decode voice patterns, require 
substantially more computing power 
than other techniques, but the imp roved 
recognition accuracy outweighs any 
added expense incurred by using bigger 
microprocessors. 

The voice recognition system has a 
speaker* indepeotie at mode, which 
means a person does not have to train it to 
learn his or her voice. For example, any 
rectal -car customer can liae the dialer. 
Any American speakers, regardless of 
tbeir accents, can be accommodated. 
Continuous speech recognition is em- 
ployed so the speaker car. talk naturally; 
no deliberate pauses between words are 
required. 

In addition to unsurpassed accuracy, 
the voice dialer solves a related communi- 
cations problem. The cellular telephone 
industry is rapidly running out of avail- 
able channels because of the demand for 
such service. However, a new algcrithm 
called VectorSum Excited Linear Predic- 
tive (VSELP) speech coding, allows the 




phone system to accommodate morft 
channeb in the available band width than 
previous methods. 

topic w Usine the dialer 

A typical application uses a grammar 
definition program built into, or down- 

^Speech recc^b'm -tecbJ&^'ii; ^tsew^ speaker verification 'A'^M 
: milium security was in trodueed ia m&'eevetal year* after research facgaiLtoUw /,J 

■ 1960». E\en then, dbasyatem was aaidt&be superior to ungtrprint ident&c^on^|;. 
- ; .T1 alio wed a mkan of the syitem to cemUol entry to its camputar.centar. ^ 

Ibday, speech md dewlopinenr. systems are 4«aljpe4>Jtto'» ^ery^appii--.« f 

■ brioM.iat&etiz^urfc*^ tel^^o^»aa^agcs^i,^la»^: ^ » 
. 'guage.ftcofai&s wd »p«ijj»rj^^ wnSca^sajj^as^i 
"rcrenow widely usacL \- ;:^wU'^:;j'4. ^^i^K^i^J 

• * Text^>tp«ch algorithm* 'convert a£OX text (a* it appear en a computer^ 
'• ..monitor) into apotara English. The. cdmpuTer-generatad-vctfcs ia nasoray lntel~ { ;\; 

Hgible, and has an unlmuted vocabulary. Specific appiJcatiOBS^ind^ 
. a&^wentoi^ertntryinptit; and ateuw re-new..* * ; *: t V;*;'*?V *^itP^Cl 

• " * rtetord/playbaci applicatioi* are aimiiar to tape recorders or didatian ma- 

/ * chmei/Th* uW can record nbteaupeeches and other material, Ho*ewr; com— ^ 
, pater. storage provides greate tdanty tiia^magnetic rec*reUngs*nd tnaWaa the 
recorded fUe to be mibr^i*^^^^ 0 ^^^** 3 ^* 3 - ' ~s?s&r.5 
v 1feittphoae maasfen^t systems employ computers io.aj»w» tekabsaea, rt~ ^ 

■ pkye*eafs^esdd^ >'| 
aisnpta voice mail In computer banteng, fot esample^cna^mer traaaactira ^ j 
pacn«d ia can e« ctmnriaed at each step ef tee process byesynthesixed yotet x ^ t i 

Language recospitioh enables a comp'uterto 'recognise complete ten^ceaaa ] 
. they ire s poken. One gyatem, fo? exaia pie, can handle app'ieaiicna requiring up to 
j. 2,000; wordal Language recognition goea beyond mere ivtord" reaction;: entire g 
: aenteacee are analyzed using context an&tym to help determine what te apono. y,j 
" Lakgoega recogmUoo ia psrticuiariy -useCe! in 'applications «rfe«» J^oearda > 



"speech and*ee»tmndttclbr teChootogy, 'aad epeech algorithms.* 
established' multiple speech databases help create sp^ 
••• for ths digits used in the voic^ dialer.'!.: at^r* :^^v^^^^^^ 
* v Speech application dewloprneh't reqiarea special aoftmrc end hardware teoli ^ 
;ra»d unuties, and rim-time. iifcr^^ 4 ^P 8 S 

and Unix plattWaiFof ei&aipk^ aapeechyyatem taotiit {Speech S^.tem ^ |a ; i| 
a$«2abie&rXai^i»IJ»i*a^ta^ 

.,tooi.ldt alao contsm»7^ mter&ae (ntXIsB ajstems c^eratm? %m&tc^pntem^ 
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loaded to, the. DSP j»emary» so eitfwrvt 
man or WDtnan can speair to a car taV 
pfecne and say * Call office" or tt Ctj 
horns* He or she can also state the roi*u 
ber to be caiied s usrn^ ihe frards % r -' 
through nir.e for digits or the word V?/ 
for zero. The user ear* aiso defirse a rcpeii 
tor/ naiee, for example, "Call Harwsry v * \ 

The heaxt <d the: cLaJer comprises ftxs£ ' 
point BSPt,. a aoM-fcaaad design parties! 
taxiy suited lor cellular phones: The ns. 
has a number of built-in hardware ft?, 
tutes that apeed the trnpiementaUon s | 
speech, recognition algorithms* Cof 
&eque«tly t tbw phones mate" ftlil use t 
state *of-tbe*ari digital technology t. 
maximhe available telephone chana? 
bandwidth. 

Voice disMxtg feaures can be added i 
ceiluiar telephones ay simply mcie&xb. 
aysteni memory — other t>S? devices arl 
not required. The single speech codi^ 
DSf* can he cjnie shared to handle vo«s 
rccognit/on «a wefll because both te 
ttens do not ceed to run &imultaneous| 
Further, integrated celhi b.r • pale phaw 
can use the same b&p conirol «tbe 
functions, such aa vthick entertftmmer 
equipment, climate, and wiadshitk 
wipers. \ 

Voice dialer new and RAM con 
binations can be varied to handle differ 
ent size boot programs, program mcmcr : 
and data. The programs differ depanck 
on the number of telephony ap^icatict 
and the functions provided. An nnals 
interface- to the telephone handsei t 
alpha- numeric display, and mterrupi 
driven connection* to the telephot 
handset complete ths set.up. 

New product tfevelopmenf 

To aid in the design of new speech re 
o^ttlon products, the dialer doubles e 
development system. An as- 222 inU 
■face, for example, supports download 
externa! software and provides & cemtfc 
for control and bput information ' 
other systems associated with the cuaj» 
As a result, the voice dialer ia eaaily in! 
grated into a epecific applicatbn en' 
ronment or.another devebpetent systi 
and evaluated. 

The port downloads u> a se? 

rate 64k In the voice dealer. The t* 
program transfers the downloaded P'' 
gram and data to the correct DSJ* memo 

The dialer has uses other than the 
phone appUcntion. They include persa ; 
computers or workstatians where 
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.i ^^nition is used instead of keyboard 
telibput. Also, ^>ice input can supplement 
iictory automation and process in- 
fection data'ror various machines and 
serfjaipwters. 

A speech recognition system can also 
ovide hands-of: control of a vehicle ea- 
irtainments/stex, cUrnete control, win- 
lows, windshield wipers, and door locks. 
Per example, a driver can select a radio 
on with Us voice or change the i»t*~ 
tempera tuns without removing his 
aids from the steering wheel The voice 
stem can also query the vehicle for fuel 
Status and m?g ratings. Even more file- 
features can be had at negligee 
. such as * voice lock that allows the 
chicle to be started only by authorized 
tifcersons. 

o; A demonstration voice dialer system is 
.... s& attained in a portable, hnefcase-ske 
coduracx. It is powered by either a 220/1 10-Vae 
ply or .12 Vdc through a vehicle cigar 
r receptacle. Sudi a portable voice 
can he used aa a development sys- 
o? a test set to diagnose faults in 
ephones in other mobile units. 
The voice dialer circuit is located on 
as pnn ted -circuit board with pogrom- 
cable array bpc fPAL) to minimize the 
sanber of Individual support logic chips* 
isce dialer subsystems include analog 
and codec, processor and Kam 
ssmory, processor control and £E?ROMs» 
itias&piqy and communications port, and 



Application-specific 
grammar 

An algorithm can be loaded that makes 
dialer recognize up to 25 words with* 

ait d tatcrimina.tjj3g mala qt f«malo woicfts* 

application- spodfce grammar can be 
ist« downloaded to the system through 
id* « rs*232 serial port, or installed at the 
id® &ctcry. 

a * A grammar is also called a sentence 
diatefcodeL The DSP and speech recognition 
in« ^orithms understand and respond to 
tfffi *atence models, and control the syntax 
^ which the words are put together. 

Alter the grammar is loaded* the voice 
sepj $tfe» recognises the f&Uowx&g sequence 
h^ commands spoken in any order, call 
See, call home, or number {digits). 
In this sequence, number is a digit 
ingof any length, for example, number 
5r66S-7777 is a legal sentence. A 1-s 
wpttts (or othsr adjustable value) termi- 
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sates any speech. When the voice dialer 
recognizes & complete phrase followed by 
the pause, it displays a period ( .) on ihe 
mice dialer 32-character alpha-nu- 
aerie Hquid-crystsi display screen. The 
commands 1 ent^* or 'cancel 4 ran also 
terminate the connection. 

Pressing the off-hook switch on either 
the voice dialer case or the handset re- 
starts the voice recognition process* In 
fact, the system recognizes just one com- 
rnahd each time the phone goes off hook. 

Other application gra m ma r s ako are 
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possible. An application may, for exam- 
ple, require that the speech recognition 
system recognize names and the word call 
as in the command call John. Jones. 

A basic voice dialer vocabulary consists 
cf H digits (zero through nine anii the 
word oh for zero) and four words (call, 
cfSce, home, and number). But other 
words are easily added to the application 
grammar. In one version of the dialer, 
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other common words used are eater, cm- 
cel t area, cod e, extension, and emergency'. 

Ttie database connection 

Many speaJker-mdependent word mod- 
els were treated for the voice dialer to 
eliminate a training phase needed by ear- 



voice dialer boots up with a 8pg a \ 
er-independont model.The models 
"seed" and the voice dialer controlling $ 
gorithm continuoualy adapts the »o4 
to the.user in what is called a voice di^,, 
training mode. 
Many novel applications also are poss 
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tier speech recognition systems. By col- 
lecting speecb samples from 200 n&ti va 
Americain speakers (100 male and 100 fe- 
male), statistical models for each vocab- 
ulary word were crested. Thus, the like- 
lihood of an unrecognizable word was 
largely diminished. Care was taken to 
sample di&rest geographical regions to 
reflect various dialects, The repertory of 
voice information vs -archived in a speech 
database. 

Recognizing that different accents 
need to be accommodated in certain ap- 
plications, a speaker- adapts operating 
mode was developed. In this mode, the 



hie using the database concept For aiaa 
pie, a vocabulary may be developed tfes< 
specific to one automobile man ufacai? 
or customer. For seme applications, iw 
B3 a personalised car phone that is & 
ahled when others try to use it, TI a 
supply s peaker -dependent capability ■ 
a code word. ' 

In the present voice dialer, all aeea 
voice recogiutioo functions, such as HJj 
aigorithms; signal processing, and gf* 
mar control are performed by one 
For more complex applications, .how* 
such as large vocabularies and more c^ 
plex grammars, more than one DSP * 1 
be. needed. Multiprocessor arcliite^J 
avlows algorithm partitioning SO V 
larger vocabularies may be reccg^ 
and accommodated. 

Experimental versions of a 
processor DSP architecture for speech 
ojnition have already been made, 
many as 32 DSPs were connected whi# ; 
present, uses an IBM at computer J 
hast for development and input/oo 1 ? 
functions. • 
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Voice Recognition in 
Cellular Mobile Telephones 

A $peaker4ndependent system has teen developed 
which allows * hands-free '* telephone use 



Thomas B Schalk 

ffirdetoir of leebmfttHY Development 
\*>ke Griniiul Systems 

mtofi. tx 



madtfd the lKjiihrtWR<»f high quality mo- 
bile tirfcrplutnc service. The first miotic 
cellular .service was offered in Chicago 
during I *MW and now over 100 major dries 
offer the service. Accompanying thU, has 
been the protffeisUoa of 5WW0 mobile 
idephone users, all of whom face fife ehal- 
tenge of manipulating their plmrrcs «n a mo- 
bile envm>omcnl< 

A typical crik** utleplKtne Iravapttfoxi- 
mately 20 tenths; correspond to the 10 
digits ami various comruitedous. Dial- 
Inj5 telephone numbers t^ikf driving can 
be dangerous because the user will typi- 
cally sake his or nei eyes; off the road lo 
manipulate the telephone iteypad. Even af- 
ter f he nJusir number has been d&letj, the 
user musr hold ine handset which makes 
shifting gears and using turn signals, 
among oster things, difficult to dc. Hence, 
many cellular phones have .an optional re- 
mote microphone that is moused near the 
visor, and a . speaker locked somewhere 
near the driver. After a caii has been 
placed, the remote microphone and 
speaker are used in a "hands- hec" fashion 
The vt>icc-diaJing mobijc ceJIufar tde* 
pbor.e is one of the most exciting and 
prorhbing appiicaiions of speech recount 
thin In telrphany, The use of voice inpttf 
for dialing can alleviate many of the safety 
problems associated aim cellular tele- 
phone systems, A speaker- independent 
voice recognition system for cellular 
phones has been developed. Thfe vofcr 
control unh .irdeslgncd to ntjerate dptt- 
mally in driving vehicles, 1U develop this 



system, an extensive voice toa. collection 
tijtA place - 

in this article the pcrfuttrance retire- 
ments for ibis application will he consid- 
ered first, then data collection procedure!: 
und rhf funcifnnsl capabilities of the recog- 
nizer will Im.* di&atssetf. 

Recognition Performance 
Requirements w> 

The recognition technology used in the 
voice contra unit for the cellular tele- 
phone is speaker- independent and ope*, 
atcs oh isolated speech, ror operating the 
phone by voiced In pfetee df-rtw key pad, 
only a iimyjl fixed vocabulary is needed 
The primary advantage irf using spt^ker 
independent technology for the cellular 
appt icanon is ihitt the recognizer need run 
he trained by each user, Since the noise 
encountered during driving conditions 
varies wernerKihusly, it is not pnetical m 
expecr users to train teepgniRcrs properly 
fur the mohik: cnvirtJiunent, hn example, 
should the user train the system wi:h the. 
engine on or off? Should liie Wowcr bcse* 
at tow, medium, or high? On what nxid 
surfaces ihuuld the user drive during fram- 
ing, and at what Speeds? m\speuker- inde- 
pendent reference data used fur devdrrp- 
went pi the voice conlrol unit were 
cdlectcd under a wide variety of eondi 
tl«ns. which awKints forms* Htbusmetfs 
the resultant recognizer in various mohtic 
environmental noises. 

It should be noted. Though, that a 
.sfw^ket -dependent capability would He 
useful because if oflfers vocabulary ftexilul- 
Ity; This wotiid allow the user to create 
custom -VCK-Aolariej ihx fneiude peonte's 
names to faciittate speed tiaUa^. hor exam- 
pie. a person cnuW Simply pick up the 
hawiset and say "specd-dia! Bob Smhh" 

S?££CH ; TLCHNt3LtXnr < . SSPT.rtKJT tV»6 



afKi 5 be preprogrammed phone number 
r*ir !Sob Smith wtiuld he dinted uutpmaM* 

djmmcfcraliy avaHrfnie ^vcc\\ reeog- 
nlzer$ exhibit a wkk* of recogttititjn 
pcrinrmancc. To assess performance, rcc- 
ogivlion accuracy <if tlK^ syraem must be 
n^eaa^red. in a crude sense, recogmnon 
Kcunury refers «> the percentage of the 
lime that the recognizer c< Erectly cisssi- 
fics ar inptu uitenuwe. It depends on a 
numbet of fact««rs, such ,» wltciiwihe 
system is. speafaf r-de3>endcnt or speaker- 
independent, whether ihc system is '* dis- 
crete-or connected wtsrd reciignirer, the 
difficulty of the vocabulary, the cost of the 
system, Bnatheenvin^nienUh which the 
system is used. T he mos: strfngent per- 
fotfrtunee requirentent for ii\c celluiar ap- 
ptksrtion is tellafoie digit ret t>ghitid/>. Since 
typical phone numbers are 7 digits long. 
I udi/tdual digit accutacy tnust i>e very high 
to raaisc dialing phone numhers by voice 
practieal. Therefore, the ability to detect 
recognition errors and correci them is cr»i- 
teat 

T?w:re are three types <jf eaors that a 
recognr/xr can make, One, the most ob- 
noxious, is called a sulwtiuttkm ztmu A 
substitution occurs when an incorrect 
word is hypomcsrzed far a- valid input t«~ 
ttttnee. For example, it the active vticahu« 
lary for a recogiiivi-f includes digits and a 
"t vvs?" is byi>oihcsiy,ed when a ' nine'" was 
actually spoken, men ti»e rtct>gni>et tsAaid 
rt> have suhstHuted a two ftjr a nine, in 
general, buhst»ut«vs eirot rates must be 
fcjts tlttfl 2 percent for use? acceptance, 
the speaker- independent technology de- 
veloped for the cellular setcphone applkra 
tit^n achieves thl^ pcrtnrniancc goal based 
tut incasurcmertts front a large data hitseof 
speakers wbd were "naive" and "inexpert- 
enced" truogni/et users. The rcxpe-f* 



Khki'DMmyiatn ttilevialt* matiytfthe yujay problems of wltuiar tcteptemt: use. 



euecd" user lends to incur . much lower 
error rates, making the successful recogni- 
tion of seven consecutive digits quire like! j; 

The second type of recognition error is 
a rejection error/This tKcurs wrurn a valid 
tnpui iitreniiKe is run claKsSIted hy : the rec- 
ognizer. Vf hcrt rcjteticns.occur, vhe user 
*iinply repeals the uoerance— ideally tme 
iitne — i.'rttil it is rccoghiJccd. Reaction er- 
ror* arc notys^rvwriau* us siihsm utlt>ns, 
»h« shiwld mK*icciir iih»c ittttni percent 

me time fur user acceptance. 

The third type of recognition em>r is a 
spurious response errot\ This occurs- when 



an. invalid input "sound" (such as,a horn 
htmking or uttering 3 w?>rd. not found in 
the vocabulary! i3 classified as a vocabu- 
lary word. ideally a recognizer- should re- 
ject all spuritms input.. Unfcirtunately, none 
i)i . today *s recognizer > a a- i wi turn;., to h . 
Serious responses cvn be mlnimiV.ed hy 
either using puslvto-talk microphone ar- 
rangements or close-talk microphones. 
The cellular phone teenghHM*» system 
scribed here was de.stgneti to work . under 
ckwe-ralk i-oi^iiilnns with a celatlarlWHj- 
set. An ex perl rnenta) System using a"Nai- 
talk" m irroprKjne has also been develo^d 



and it uses a posrwo-tulk micrtjphohr jkth- 
vadon system to inirisrniyxsptJriuit^ crnnm. 

* is no: feasible to quantitatively mra> 
Mire the spurious response- error race for a 
given recognizer Tit do so would involve 
Wfccdng a database of all ptxwihJe sound* 
dval can occur Nevertheless ..if such a data 
hnse. did exist, spurious rcsporwe error 
rate? of SO peK'eni'wtHjid no? be surprising 
for a typical recognizer ThLs v 'flH.nm^ma$ 
about hall ilwr time a Jipuritjus s«»«nd oc> 
curs shut is ioild enough, tin.' recognizer 
wtll.attctnpi so classify it as a word iri the 
ADCdHilary 
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This data co/fccr/orr process took abottt 10 minutes per itkfivfduat. 
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Data Collection Procedure 



Speech recognizers generally exhibit 
sensitivity id changes in tip environments 
in which they arc used, The noise charac- 
teristics of moving vehicles vary dramati- 
cally depending ort^he dri ring conditions 
ana the charactcri-itics of the vehicle it- 
self— say a luxury compared co an Inex- 
pensive small car To develop live speaker- 
imicpendem reference data far the voice 
control unit, a large dara-basc eollrcrion 
was. conducted in moving vehicles. Data 
samples were collected from over 500 
pcopte— approximately 100 speakers each 
in S different automobiles sclectea to span 
a wide- range of driving noise. For safety 
reasons, the voice donor sat on the passen- 
ger side during data coifceoon. 

There were two phases to the data col- 
lection. The first phase involved the collec- 
tion of data from a t ethotef *(ar5olk?') mi- 
crophone mounted near me visor area.. 
During the second phase, samples were 
collected through a cellular telephone 
handset. For each phase of the collection, 
the voice donor was Instructed to say 
words as they appeared on a custom built 
pnimptMig display located cm the passen- 
ge r> side of the dashboard. The voice do- 
nors were instructed to say the words 



quickly; m an author hat ive manner. 

Immediately after the far-talk portion of 
the. collection was completed, the volun- 
teer was again instructed to apeak ihe 
words as they appeared on the monitor, 
but if ifs* time they were to be said into (he 
cellular handset as though ihey were "eotv 
versing" with it. The entire data collection 
process took approximately 10 minutes 
per individual. 

The speech samples were recorded us - 
ing portable digital audio processing re- 
cording equipment;. This equipment con- 
sisted of a SONY Portable VCR 5=1 -2000 
and a SONY PCM F- 1 Digital Audio Proces- 
sor {Fig. I J, a small portable computer-was 
used to control the recording equipment 
and was progntmrrted.to feed the speech 
vocabulary prompts to the prompting diV 
p!ay located on the passenger . side of the 
dashboard. In addition, the tapes were au- 
tomatically marked to indicate speakct 
number, speech track -boundaries, and 
prompting information. Since only the 
video portion of the tape was used for 
PCM speech recording, the normal audit* 
tmck was available for writing ASCI! coded 
DTM F tones to code this in format km. This 
allowed for unsupervised digitization of 
the speech data onto the v&X computerfor 
speaker independent vocabulary dcvchip- 



mem. The vocabulary words collected 
through the handsets included ihe Mint- 
ing list of words: 



1. anc 


15. send 


2. ;wo 


16. cancel 


5. ihree 


17 u clear 


4. four 


18. verify 


5- ave 


19. spouse 


6, six 


20. home 


7, seven 


2-1, friend 


-3.. eight 


2t£. work 


9. nitte 


25. office 


to. tero 


24. school 


n. oh 


2S. service 


12. dial 


26, information 


B. recall 


27. airline 


\4. ipeed-diai 


28. emergency 



Dicing ihe data -base collection, ancm- 
phasis was placed on obtaining a reason- 
able distribution uf different d«aiecis, and 
cm collecting a n cquaJ number of male and 
female voices, as stated earlier, ft\"edtffcr : 
en: cars were used in the collection, as well 
asa vac iciyof handsets; Most of the speech 
was collected In a. moving vehicle under 
many different environmental additions, 
Some of these conditions were rain with 
windshield wipers on. .dc/rostcr on. air 
conditioner set at various levels, heater set 
at various levels,, window^ open, and a ra- 
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did playing in the background. Further- 
.more, the data samples were -cottected in 
many different locations throdghmJC the 
UaltusJft Vftwrh Mctropfcx yielding driv- 
mg conditions that differed greatly from 
* use location tr > the next . Pr ior to trie devd- 
lament of the speaker-independent refer* 
ence dam, *ll-«if the collected speech was 
audited and docu merited. Special atten- 
tion was ftiyen 10 dialect variations, sgc of 
sjK'ukcr, and any extraordinary character- 
istics itssturfoteU with either the speaker or 
die driving eondit ions at the rime of the. 
i*oitceu?m. 

Cellular Telephone Voice 

Control Unit ______ 

The cellular telephone voice control unit 
discussed here has some note worthy char- 
uraerkoes. The voice recognition technol- 
ogy is Brjeaker-independcn;,. thus there is 
Ao u&f training required. This specialised 
rwogrwion system was designed specifi- 
cally for <rtc noise charactcristKrs of the mo- 
Sic com roun scat ions environment. U is a 
i ware-based ceeognrtfrr trial requires a 
rungle general purpose microprocessor 
(Intel 80 itf6} for implementation. The rec- 
ttgnttlon circuit interlaces to the mobile 
-tdephone through the bus that connects 
the phone control unit (located near the 
vehicle driver) to the transceiver unit 
(mounted in the vehicle irunkjas Indicated 



in Rg. 2. - 

The functional operation of the voice 
unit centers around syruaalcally scree- 
lured voice commands from the user, and 
voice responses 1mm the voice control 
unit. The command syntax structure is il- 
lustrated in Fig; 5- . This simple scheme for 
voice dialing invoivesa recognition vocab- 
ulary of only 28 words (listed earlier). The 
output chanrtci of the CODEC In the voice 
recognizer from -end is used for voice re- 
sponses to guide the usee and provide aura! 
Feedback for validation of Input. As llius 
trailed in fig. 2. the voice control unit taps 
into the voice chahncl-and-comroi inter- 
face on the control bus of the cellular tde- 
pttone system. The vo^e control unit rec- 
ognizes voice commands, given to the 
phone and then issues appropriate com- 
mands to qpcrate die iclcjji tunc, 

Each voice -command to the phone is 
acknowledged by the voice control unit 
through an aural response. If the com- 
mand is recognized, then a short beep* 
tone signifies to the user thatihe voice con- 
trol urth has recognized the command and 
is ready for the next ore, If the measured 
signal-ro*nOise ratio of a detccre d utterance 
is below 20 d3 f the ui a? ranee is rejected 
and the voice corurdi unit asks the user to 
"speak louder.** If an utterance has an ac- 
ceptable signal -to -noise ratio, hut is not 
identified with sufficient confidence, U Is 
rejected, and the response "repeat" Jv is- 



sued to the user. Indicating he or she 
should re enter the command. 

To dial phore numbers, the user simply 
says "rttal** foBowed by a string of digits. 
After speaking the last digit of the tele- 
phone n omhet, the user s ays *• verl fy' 'and a 
vojcc response system is activated which 
repeats the recognized digit sequence 
through the earpiece of tfvc tends el. The 
user then say* '"send" in order for that 
number to bedialed, or "clear* U' ati Im- 
proper digit sequence occurred, which 
activates the top-node vocabulary (disl. rc- 
c*U, speed>d'aJ) and "ready" is syn- 
thesized. 

for memory^ dialing (preprogrammed 
phone numbers), the user says "recall* 1 
and then utters a one or two digit se- 
quence, depending on hew many prepro- 
grammed nurebcra the cellular telephone 
can store. The recognized sequence is re- 
pealed and the corresponding phone num- 
hcem memory is dialed If the word "strnd" 
Is spoken and recognized. Spced^dialittg is 
acheived by simply picking up the handset 
and saying "speed dial" fcuowed by one of 
the ten destination descriptors such as 
"home," "office." •'friend," etc, The rec- 
ognized wordi arc then repeated and the 
user can activate the call by saying 'send/' 

Enhanced Cellular Telephone Use 
The ceUuiar telephone voice control unit 



VOICE-CONTROLLED MOBILE CELLULAR TELEPHONE 
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rig. 3 

VOIC€ COMMAND STRUCTURE FORCELLUI.AR TEIXPHGNE RECOGNIZER 



DIAL 
_t_ 



CKgit Sequence 
0 9. oh 




Two diojl 
memory code 



Send. Clear 




SPEED-DIAL 
_J 



Spouse, Rome/Friend 
v/orH. Office, School 
Service, information 
Airline, Emergency 



Dial Phone 



A 2£ u&rtf vaawuiary is uikfjfUr ibis njice-diaim^ system. 



achieve* robust spcakcr-lndcpcndem 
speech recognition lit a highly variable 
'high-noise environment.. Performance 
mcasutcmcr.ts using the data base col- 
lected in the mobile automotive environ* 
mem yielded substitution error -below 2 
percent ana" rejection . rates below 5 per- 
cent. The system tested has bs:en Imple- 
memeO in relatively simple hardware 
which is* fraction of the cos: of ?he cellular 
telephone unlr ti enhances. The cellular 
telephone voice control unit stands as one 
of the bcsi.examplcs of a fruitful applica- 
tion of speech recognition technology. It 
achieves its high practkvjl value by s'smpn- 
fying a critical interface and ihtw sjgruTi- 
canny enhancing the safety of cellular-tele- 
phone use. 38 
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